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ARTIFICIAL TRANSCRIPTIONAL FACTORS AND METHODS OF USE 

Government Funding 
This invention was made in part with government support under grant 1R41 
GM57712-01 awarded to the inventor by the National Institute of General and Medical 
Sciences. The government has certain rights in the invention. 

Cross-Reference to Related Applications 
The application claims priority under 35 U.S.C. § 1 19(e) from and is a continuation in 
part of provisionally filed application U.S.S.N. 60/240,479, filed on October 13, 2000, and 
entitled "Artificial Transcriptional Factors and Methods of Use. 5: 

Background of the Invention 
This invention relates to regulation of gene expression, and in particular, to the design 
and synthesis of compositions useful as artificial transcriptional factors (ATFs). The 
invention also relates to the use of such molecules novel therapeutics that regulate gene 
expression at the level of RNA transcription, as tools for gene regulation and for target 
validation in functional genomics, and in pharmaceutical drug development. 

A crucial step in eukaryotic genome regulation involves activation and repression of 
RNA synthesis by transcription factors. Transcription factors are modular proteins that 
contain at least two functional parts: a DNA-binding domain and an activation or repression 
(i.e. "effector'-) domain (Ptashne, "A Genetic Switch: Phage Lambda and Higher Organisms. 
Cell and Blackwell Scientific, Cambridge, MA, (1992): and Ptashne et al., Nature 386:569- 
577 (1997)). Typically, the DNA-binding domain anchors the transcription factor to the 
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promoter through interaction with specific DNA sequences. Effector domains participate- in 
interactions with other proteins involved in RNA transcription such as TATA-BOX binding 
protein (TBP) and TBP-associated proteins, proteins comprising RNA Polymerase II 
holoenzyme. coactivators. compressors, histones and histone-modyfying enzymes, and others 
5 (Blau : J el al. 3 KM Cell Biol 16, 2044-55 (1996); .Brown, S. A., el ah EMBO J 17, 3146-54 
(1998); Collingwood, T. N. : Urnov, F. D. & Wolffe, A. P.JMpl Endocrinol 23, 255- 
75(1999)). These protein-DNA and protein-protein interactions facilitate or inhibit the 
assembly of the transcriptional apparatus at the promoter and, therefore, activate or repress • 
the synthesis of RNA encoded by the regulated gene (Koleske et al.. 'The RNA Polymerase 

10 II Holoenzyme And Its Implications For Gene Regulation". Trends Biochem. Sci. 20:1 13-1 16 
(1995): Hampsey : M. & Reinberg : D. Curr Opin Genet Dev 9, 132-9 (1999)). 

Experimental data reported thus far strongly indicates that effector domains are 
functionally independent from DMA-binding domains. For example, an activation domain of 
one transcription factor can be attached to the DNA-binding domain of a different factor. 

1 5 Such hybrid molecules retain their characteristic functions across the range of organisms, 
from yeast to human ceils, thus indicating that the basic regulator) 7 mechanisms are common 
to diverse eukaryotes (Brent et aL Cell 43:729-736 (1985)). Therefore, the DNA-binding 
and effector domains could be considered as completely separate functional and structural 
entities joined together as a bifunciional molecule. 

20 Conceptually, the modular structure of transcription factors implies that each 

transcription factor molecule could be simply regarded as a bi-dentate ligand : where one of 
the ligands (DNA-binding domain) binds DNA in a sequence-specific manner and other 
ligand (the ;; effector" domain) binds a protein thai participates in transcriptional regulation 
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(activation or repression). This simplified model raises a possibility of replacing the DNA- 
binding and/or effector domain with synthetic chemical moieties to yield a novel class of 
molecules - synthetic or artificial transcription factors (ATFs). The main goal of the ATF 
molecular design is to introduce new 5 drug-like chemical properties such as a lower 
molecular weight resistance to enzymatic degradation and cell membrane permeability while 
preserving the crucially important biological function - the ability to regulate KN A 
transcription of specific genes. 

There have been several recent attempts to combine the minimal activation domain of ■ 
strong transcriptional activators like VP16 or GCN4 with artificial DNA-binding domains 
based on a triple-helix forming oligonucleotide or a pyrrole oligomers. These efforts have 
yielded molecules having a barely detectable activity (Kuznetsova et al., Nucleic Acids Res. 
27:3995-4000 (1999)) or showing only a weak activation limited to in vitro (cell-free) assays 
(Mapp et al, Proc. Natl. Acad. Sci. USA 97:3930-3935 (2000). In general, the activity of 
such hybrid molecules is extremely weak (and possibly non-specific) compared to that of the 
natural (or protein) activators from which portions of these molecules were derived 
(Sadowski et al., Nature 335:563-564 (1988)). 

There remains a need for the identification of novel transcription activators and 
improved activation systems. In particular, there remains a need for artificial transcription 
activators that match or exceed the activation of strong natural transcriptional activators. 
There also remains a need for novel transcription repressors and repressor systems. In 
general, there remains a need for novel transcription systems that can modulate the 
transcription process. The molecules would open the possibility for the first practical 
applications of the technology in biology and medicine. 
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Summan' of the Invention 
The present invention relates to a molecular design resuliing in highly potent ATFs. 
as is demonstrated both in cell-free assays (in vitro) and in vivo (in tissue culture cells). 
5 These ATFs may be used as tools for gene regulation at the level of transcription and as a 
class of gene targeting pharmaceuticals for treatment of cancer and many other diseases. 

In one aspect of the invention, a composition for modulating transcription of a 
eukaryotic gene is provided. The composition includes a non-peptidic DNA binding domain.- 
a flexible iinker. and a transcriptional effecior. one end of the linker being covalently bound 
10 to the DNA binding domain, and the other end of the linker being covalently bound to the 
transcriptional effector. 

In at least some embodiments, the effector activates transcription. In at least some 
other embodiments, the effector represses transcription. In at least some embodiments, the 
flexible linker of the ATF of the invention is at least 10 A . or at leas- 1 5 A in length or. in at 
15 least some other embodiments, at least 28 A in length. In at least sum- embodiments, the 
flexible linker has a length in the range of 10-100 A. In at least sonu- embodiments, the 
flexible linker has a length in the range of 15-25 A, or in the ran:v «.j 25-40 A. or in the 
range of 40-60 A, or in the range of 60-100 A. 

In at least some embodiments, the composition for mod;iu:n..- ^inscription of a 
20 eukaryotic gene binds a co-modulating protein. In at least sunk c:nf-. ■.::::i-jnu>. the 
composition represses transcription and binds a histone or hish.x- n... mi- protein. 

In at least some embodiment, the DNA binding domain i> in.^Ww ;icid. In at least 
some embodiments, the nucleic acid is a modified nucleic acid. In jnmhv: embodiment, the 
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nucleic acid includes a modified backbone, or the nucleic acid contains modified bases. In 
still other embodiments, the modified backbone comprises substitutions for phosphodiester 
bonds selected from the group consisting of phosphorothioates and peptide nucleic acids. In 
other embodiments, the linker is bound to the 3' or the 5 ! end of the nucleic acid. 

In at least some embodiments, the DNA binding domain is a peptidic nucleic acid. In 
some embodiments, the DNA binding domain does not contain a plurality of pyrrole or 
imidazole groups. In at least some embodiments the DNA binding domain is a sequence- 
specific DNA-binding natural product. 

In at least some embodiments, the transcriptional modulator is a polypeptide 
sequence. In some embodiments, the transcriptional modulator is an polypeptide activator 
and the polypeptide has a polypeptide sequence including at least one copy of an activator 
sequence of amino acids from Herpes simplex viral protein VP16 comprising SEQ ID NO: 1 
(see, Table 1.) (GSDALDDFDLD). In at least one embodiment, the polypeptide has the 
sequence of S WQUID NO: 12. In at least some embodiments, the peptide sequence includes 
two copies of the activator sequence of SEQ ID NO: land a cysteine residue at an amino 
terminus of the peptide sequence, the peptide having the sequence of SEQ ID NO: 2 (see, 
Table 1.) (CGSDALDDFDLDGSDALDDFDLD). In at least some embodiments, the 
polypeptide sequence includes a cysteine residue and two copies of the activator sequence of 
SEQIFNO.-12, the peptide having the sequence of SEQ ID NO:8. In at least some 
embodiments, the amino-terminus of the polypeptide transcriptional modulator is covalentiy 
bound to the linker. In at least some embodiments, the carboxyl-terminus of the polypeptide 
transcriptional modulator is covalentiy bound to the linker. In at least some embodiments, a 
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cysteine residue of the polypeptide transcriptional modulator is covalemly bound to the 
linker. 

In at least some embodiments, the transcriptional effector moiety of the composition 
for modulating transcription of a eukaryotic gene has a molecular weight of less than about 
5 3,000 daltons. In at least some other embodiments., the transcriptional effector moietv 

molecular weight of less than about 1.500 daltons. In at least some other embodiments, the 
transcriptional modulator moiety molecular weight of less than about 1.000 daltons. 

In at least some embodiments of the composition for modulating transcription of a ' 
eukaryotic gene, the flexible tinker includes a polyglycol. In some embodiments, the flexible 
10 linker polyglycol contains at least two, or at least four : or at least six ; glycol units. In another 
embodiment, the flexible linker includes a plurality of monomer units selected from the 
group consisting of nucleotides, peptides, and lower alkyls or other oxygen-containing alky] 
chain derivatives. 

In at least some embodiments, the amount of transcription initiated on a double- 
15 stranded DNA template is at least ten-fold greater compared to a second amount initiated in 
the absence of the composition. In at least some embodiments, the amount of transcription 
initiated on a linear double-stranded DNA template is at least 20-, or at least 30-. or at least 
40-, or at least 50-fold greater compared io the second amount in the absence of the 
composition. In at least some embodiments, the amount of transcription is 30-50 times 
20 greater than transcription in the absence of the ATF of the present invention. 

In at least some embodiments of the invention, the DNA binding domain of the 
compositions for effecting transcription (modulating) of a eukaryotic gene has affinity for at 
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leasi one DNA binding site on a DNA template, which DNA template is less than aboui 500 
base pairs in length. 

In another aspect of the invention, a composition for activating transcription has the 
structure A-B-C, wherein A is a triplex-forming nucleic acid, B is a flexible linker, and C is 
5 an activation moiety that binds to a site on a transcriptional protein complex comprising an 
eukaryotic RNA polymerase, wherein B is covalently linked to A and C. In at least some 
embodiments, B is a polyglycol chain, and the covalent linkage of B to C includes a 
bifunctional crosslinking agent. In at least some embodiments, the linker, B/is a polyglycol 
of at least about 28 A in length, and C comprises an amino acid sequence from Herpes 

1 0 simplex viral protein VP 1 6. 

Another aspect of the invention provides a method for assaying a test compound for 
activity as a transcriptional modulator. The method includes linking the test compound 
covalently to a flexible linker domain which is covalently bound to a non-peptidic DNA 
binding domain to provide a test composition, the DNA binding domain having affinity for a 

15 DNA binding site on a DNA template sufficient to bind the site and to modulate transcription 
at a promoter; contacting the test composition with a transcription mixture including a DNA 
template, a eukaryotic RNA polymerase molecule capable of Ionium; a complex with the test 
composition and the DNA template, a buffer and substrates untie: conditions suitable for 
RNA synthesis, such that RNA is synthesized: and determining tt- quantity of RNA 

20 produced in the presence of the test composition compared to a ha v.! ic\ d in the absence of 
the test composition, which is a measure of the activity of the le a o»n;ii»Miiun as a 
transcriptional modulator. In at least some embodiments, the DNA hndmL' site is a plurality 
of repeats of the binding site sequence .In at least some embodiment of the method for 
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assaying a lesi composition for activity as a transcriptional activator, the binding site of the 
test composition to the DNA template is located within 100 base pairs of the site for initiation 
of transcription.??????????'???? 

In another embodiment of the method for assaying a test composition for activity as a 
5 transcriptional activator, the step of providing the test composition with a transcription 
mixture is performed in vivo. In at least some embodiments., the test composition is pre- 
bound to the template., and the complex is provided to a plurality of cells by transformation. 
In at least some embodiments, the test composition is provided io the cells that cany- a 
reporter plasmid template incorporated into the chromosome (stable transfection lines). . 

10 In at least some embodiments, the step of providing the test composition with a 

transcription mixture is performed using high throughput screening technologies comprising 
robotized distribution into wells of multiwell dishes. In at leasi some embodiments, the step 
of determining the quantity of transcription is performed using high throughput methods of 
detection comprising automated plate readers having programmable computerized programs 

1 5 for data analysis and display. 
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Brief Description of the Drawings 
FIG. 1 shows a pictorial illustration of the tripartate ATF of the invention. The ATF 
has three component parts, a DNA binding domain (designated A), a flexible linker (B), and 
an effector domain, for example, an activator domain (C). 
5 FIG. 2 shows (A) an ATF of the present invention and (B) one possible scheme for 

synthesis of an ATF in at least some embodiments of the invention. 

FIG. 3 shows the promoter regions of the transcription templates, wherein: (A) the 
control template contains in promoter five GAL4 and five ATF binding sites Incorporated in . 
the promoter at -53 and -155 bp relative to the +3 transcription start site, respectively; and 
10 (B) the ATF assay template contains five ATF binding sites incorporated at -65 bp. 

FIG. 4 shows an acrylamide gel eiectrophoretogram of the run-off transcription 
products in the presence and absence of an ATF (as described in Figure 2 A), with the 
contents of each lane identified as follows: lane 1 shows the low level of basal transcription 
from the control transcription template of Figure 2A; lane 2 shows the 250 base transcript 
15 activated by the GAL4-YP16 fusion protein on the control template; lane 3 shows basal 
transcription from the ATF assay transcription template of figure 2B; lanes 4-7 show the 
effect of the ATF of Figure 1 on transcription from the ATF assay transcription template, 
with lane 4 having the smallest amount of ATF, and lane 5 having five-fold more ATF than 
lane 4, lane 6 having 100-fold more ATF than lane 4, and lane 7 having 500-fold more ATF 
20 than lane 4. Lanes 8 and 9 show the effect of an ATF having the polylinker, with covalently 
attached activation domain, the polylinker being attached to the 3' terminus of the DNA 
binding domain, on transcription of the ATF test template. Lane 9 has twenty-fold more 
ATF than lane 8. 
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FIG. 5 shows an acrylamide gel electrophoeragram of the run-off transcription 
products for transcriptional activators GAL4-VP16. 3 : ATF. 5'ATF(D) and 3*ATF(D). 

FIG. 6 shows (A) reporter, constructs used in co-lransfection assays/in which the 
transcription templaie contains five ATF binding sites in the promoter. (B) a representative 
CAT assay whereby BHK-21 were co-transfected with transcription template alone (lanes 1 
and 2) or in combination with ATF : s (lanes 3-6). and (C) a bar graph of the mean fold 
transcriptional activation by ATFs for three experiments, each performed in duplicate; error 
bars represent SEM (n=3). 

Detailed Description of the Invention 
Definitions 

The following definitions are used throughout this description and in the claims, 
unless the context otherwise requires. The patents and scientific literature referred to herein 
establishes the knowledge that is available to those of skill in the art. The issued U.S. 
patents, allowed applications, and references cited herein are hereby incorporated by- 
reference. 

As used herein the term ''artificial transcription factors'' includes synthetic 
transcription factors. 

The term t; abour is used herein to mean a numeral value having a range, of ± 20% 
around the cited value. 

The term "transformation" refers to a genetic eveni used in construction of a cell line 
or sirain. resulting from mixing recipient cells with DNA such that the DNA enters ai least a 
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portion of the cells, and includes transfection, lipofection or other liposome-mediated 

process, and eiectroporation. 

A "transcriptional effector" refers to a chemical composition which when present in 

the vicinity of a promoter, and bound to a DNA binding domain, causes an increase or 

decrease in quantity ofRNA synthesized from a particular promoter or class of promoters. 

Transcription in the absence of an effector is said to be at a "basal" level. Transcription can 

be activated (also known as induced, or up-regulated) by a positive effector or "activator". 

Similarly, a basal or activated level can be repressed, or down-regulated, by a negative 

effector or "repressor". 

A transcriptional effector can act near or at the site of initiation of transcription of a 

gene. Genes are transcribed when RNA is synthesized in a 5' to 3' direction using a strand 
of DNA as a template. The site of initiation of transcription, in which a first ribonucleoside 
triphosphate complexes with the RNA polymerase, occurs complementary to a site on the 
DNA template known as "+1 ", with each successive nucleotide addition occurring 
complementary to "downstream" sites with increasing positive numbers. "Upstream" of the 
+1 site are generally found the DNA regulatory signals, such as the promoter, that specify 
binding of the transcription factors, components of the transcriptional machinery, RNA 
polymerase and associated proteins. A promoter is generally found within a fixed distance 
upstream of the +1 site, to position the RNA polymerase holoenzyme appropriately for 
transcription initiation. Regulatory signals in the DNA sequence that modulate the amount of 
transcription, such as the GAL-4 binding site sequence originally discovered in yeast, are 
generally located in the promoter, for example, upstream of and adjacent to the +1 site. 
Nucleotide and amino acid sequences referred to herein are listed in Table 1. 
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SEO ID NO 


| Sequence 


1 1 
i „ 


NH 2 -GSDALDDFDLD-COOH 


i 2 


1 NH ; - CGSDALDDFDLDGSDALDDFDLD-COOH ~* 


3 


1 NH 2 - FLFQLPQQTQGALLTSOP-COOH 


4 


! NH 2 - WAVYELLF-COOH 1 


5 


5 TTGTGGTGGGTGGGGTGTGGGT3 ' 


o 


:> I lul ub 1 ubu 1 (j(ju(j 1 GTGGGTX\ C3 


i 7 


5 1 C YXCTTGTGGTGGGTGGGGTGTGGGT3 ' i 


is 


NH2-CGSDALDDFDLDMLGSDALDDFDLDMLGS-COOH 


19 


5'GTTCTCCTCCCTCCCCTCTCCCTCTT3' 


! 10 


3 "AAGAGGAGGGAGGGGAGAGGGAGAAG5" 


11 


-vTTCTCCTCCCTCCCCTCTCCCTCTT3' 


12 


CGSDALDDFDLDML 



A means alanine; C means cysteine; D means aspartic acid; E means glutamic acid; F 



means phenylalanine; G means glycine; H means histidine; I means isoieucine; K means 
lysine; L means leucine; M means methionine; N means asparagine; P means proline; Q 
means glutamic acid: R means arginine: S means serine; T means threonine; V means valine; 
W means tryptophan; and Y means tyrosine. 



Detailed Description of Embodiments 
Ii has been surprisingly discovered that a tripartite ATF ? in which no part contains an 
intact folded, protein domain, can function as well as or better in activating transcription 
than the protein transcription modulators from which the design of these portions are taken. 
The design of the tripartite molecule is such that the ATF contains a DNA binding domain, a 
flexible linker, and a transcriptional effector, thai activates or represses or otherwise modifies 
transcription, also referred in the description and the claims as A. B : and C. respectively. 
None of these "domains.." however need be a protein domain. The meaning of "domain," as 
used herein, refers 10 ihe function of each of the three parts of the ATF molecule. 
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In one aspect of the invention, synthetic or artificial transcription factors (ATFs) are 
provided with activity comparable or even exceeding that of the natural transcriptional 
activator proieins to which certain ATF components are related. In some examples of the 
invention, the transcription response, i.e., either activation or repression, is 10 times, or 20 
5 times, or 30 times or 40 time or even 50 times greater than that of the controls, i.e., a system 
in the absence of the ATF of the invention. 

An embodiment of the ATF of the invention is illustrated in FIG. 1 . The functional 
ATF includes a DNA binding domain A, shown here as a triplex forming oligonucleotide 
(TFO) bound at the 5' position to a linker B. It is possible, however, to bind the linker at the 
10 3' end of the DNA binding domain with no loss in activity. The linker is joined at its distal 
end to the effector domain, C. i.e., a domain which has an effect on transcription. The effect 
may be to activate or repress transcription. Here, the effector is shown as an activating 
domain, AD. As is described in greater detail below, the individual domains of the ATF 
molecule are most typically covalently bound; however, it is contemplated that any 
15 appropriate association, i.e., covaient bonding, hydrogen bonding, hydrophilic or 
hydrophobic association, may be used to form the ATF. 

The DNA binding domain, A, is any non-peptidic moiety with affinity for a specific 
recognition site within the promoter DNA. By non-peptidic moiety, it is meant that the 
domain does not include a substantial amount of a natural amino acid. Substantially 
20 excluding peptidic components in (he DNA binding domain does not exclude the possibility 
of isolated inclusion of amino acids. For the purpose of this invention, substantially non- 
peptidic shall mean less than 50 % or less than 20 % of natural amino acid content. The 
choice of DNA binding domain depends on the gene intended to be activated. The DNA 
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binding domain recognizes a siie this is rypically positioned relatively near to the 
transcriptional start site of the gene for which the activator can affect transcription, although 
some activators may be able to act over long distances. Many activators or repressors are 
able to act over long distances and use of these effectors is contemplated in the invention. 

In at least some embodiments, the DNA binding domain is an oligonucleotide, such 
as a triplex forming oligonucleotide (TFO). These moieties are thought to bind in the major 
groove of the DNA helix. Design of triplex forming nucleic acids is described in U.S. patent 
number 5,874,555 to Dervan, et al, which is incorporated herein by reference: It is 
anticipated that additional types of DNA binding domains can be substituted for triplex- 
forming nucleic acids. 

TFOs are able to form strong and stable sequence-specific complexes with double- 
stranded DNA at physiological conditions (Maher et al... Biochemistry 31:70 -81 (1992)). 
This property is not adversely affected upon conjugation with effectors, as is established in 
the examples below. Although not being bound to any mode or theory of operation, it is 
possible that the linker plays a role, among other possible roles of the linker, in "insulation" 
of the TFO from interference by the attached chemical groups. 

The triple helix formation between the TFO and DNA is facilitated when sequences 
of both molecules have special features, most notably uniform stretches of purine bases in 
one DNA strand and pyrimidine in the complementary strand. This is not an obstacle for 
many applications such as pharmaceutical (gene targeting) applications or in vitro and in vivo 
ATF assays because the sequences on TFOs and its corresponding binding sites in any 
promoter of a reporter gene can be chosen at will. In addition, such sites have been shown to 
exist in native genes, most often in the regulatory regions of promoters (Svinarchuk et al.. 
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Nucleic Acid Res. 22:3742-3747 (1994); Nakanishi et al., Nucleic Acid Res. 26:5218-5222 
(1998): and Bramachari el al, Gene 190:17-26 (1997)). In addition, TFOs that are 
chemically modified may recognize many additional kinds of DNA sequences. For example 
ii is now possible to target a polypurine stretch interrupted by several pyrimidine residues 
with the "bridged" or clamped TFOs (Helene et al, Ciba Found. Symp. 209:94-102(1997)). 
Also, the triplex recognition scheme can be extended by synthesizing TFOs with nonnaturai 
bases and nucleotide analogues. TFO-based ATF may take advantage of the recent 
improvement and availability of genomic databases that makes it possible toidentify 
convenient TFO-binding sites in promoters. In one application of the present invention, 
genes of medical interest and suitable binding sites are identified and a TFO sequence having 
a suitable affinity therefore is synthesized. 

In addition to TFOs, there are other classes of molecules that can be used as artificial 
(non-peptidic) DNA-binding domains. In at least some embodiments of the present 
invention, the DNA binding domain is a peptide nucleic acid (PNA), a molecular analog of 
DNA in which the phosphate backbone is replaced with a backbone similar to that found in 
peptides. Peptide nucleic acids can bind to single-stranded DNA by Watson-Crick base 
pairing and can form triple helices to DNA/PNA duplexes much in the way of nucleosides 
(Nielsen ei al., J. Molec. Recognii. 7:165-70 (1994); Egholm et al, Nature 365:566-8 (1993); 
and Kim et al., Nucleic Acid Res. 27:2842-7 (1999)). A PNA "clamp" consisting of two 
PNA strands connected with a flexible linker can form a very stable complex with DNA 
duplex, and can be designed to target similar kind of polypurine and polypirimidine DNA 
sequences as TFOs. Recently a new generation of PNAs called "pseudo-complementary 
PNAs" (pcPNAs) were synthesized (Izvolsky et al. Biochemistry 39: 10908 (2000)). These 
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PNAs target the designated sites on DNA that contain mixed sequence of purines and 
pyrimidines via double duplex invasion mode. Since the backbone of PNA is noi chareed. 
ihe lack of electrostatic repulsion leads to the formation of strong and stable complexes with 
DNA. Also, PNA has a smaller mass per monomer unit than DNA and is generally resistant 
5 to degradation by enzymes thai can attack the phosphate backbone of an oligonucelotide. 
These and other properties make PNA a very attractive choice for ATF DNA-binding 
domain. 

In another embodiment, the DNA binding domain may be a peptide analog, such as 
polyamides. e.g.. polypyrroles and polyimidazoles. described in United States Patent No. 

]0 5,874.555. These moieties are thought to bind in the minor groove of the DNA helix and the 
activation observed for ATF using a polyaraide DNA binding domain is not as great as for 
ATFs having a oligonucleotide DNA binding domain. It is though! that the nature of binding 
of the DNA binding protein within the minor groove of the DNA double strand may limit the 
ability of the attached transcription effector to interact efficiently with the proteins 

15 comprising the transcription machinery. 

In still other embodiments, the DNA binding domain is a non-protein binding domain 
including sequence-specific DNA-binding natural products such ;in uniihiotics or other 
moieties using small organic material. 

The effector domain, C. can be a positive (an activaion r: \:\\\i\\\>c in repressor) 

20 modulator of the amount of basal level of transcription (mllNAi M.::.;. Unuiional activator 
and repressor peptides arc known and may be used in the ATF m<-\ u.v \>\ \\vc present 
invention, in some embodiments; the effector domain includes ;s Mni.r-k- chemical coupling 
group (e.g.. an amine, carboxyl or thiol group) thai allows for the iorm:nn»ii of* the covaient 
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bond between the effector and the linker (FIG. 1). In other embodiments, the effector 
domain, linker and DNA binding domain (or a sub-component thereof) are synthesized as a 
single molecule, so that no additional chemical coupling group is provided. 

In some embodiments, the effector domain is most typically a peptide, selected for its 
ability to up or down regulate transcription. The peptide may contain either L- or D-amino 
acids. Exemplar}', but non-iimiting examples of activating effectors include VP- 16, Oct-2, 
and active fragments of VP- 16 and Oct-2. In at least some embodiments, a 29-mer or a 14- 
mer of the VP- 16 peptide has been shown to have an exceptionally strong activation effect. 

In at least some embodiments, shorter peptides that include much, if not all, of the 
activation function of the longer peptide may be used as a part of the ATF molecule. Shorter 
peptides may be identified by determining the shortest peptide segment of the activator 
domain, e.g., the "core'" activatore, that remains functional in in vitro or in vivo assays in 
order to minimize the size and mass of the effector domain. 

In at least some embodiments of the invention, peptides derived from other general types of 
activation domains, such as a glutamine rich domain of Oct-2,are used as the effector 
domain. It has been shown that 18 amino acid peptide SEQ ID NO: 3 
(FLFQLPQQTQGALLTSQP) forms a core activation sequence of Oct-2 (Tanaka, M., 
Clouston, W. M. & Herr, W. Mol Cell Biol 14, 6046-55 (1994); Tanaka, M. & Herr, W. Mol 
Cell Biol 14, 6056-67(1 994)).Two or three tandem repeats of this sequence form a functional 
activation domain. A recent report demonstrates that an 8-amino acid peptide sequence SEQ 
ID NO: 4 (WAVYELLF) forms a relatively strong activation domain when fused to the 
protein DNA-binding domain(Frangioni, J. V., LaRiccia, L. M., Cantley, L. C. & Montminy, 
M. R. Nai Bioiechnol 18, 1080-5 (2000)). 
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In at least some embodiments of ihe invention, the effectiveness of ATFs is enhanced 
by combining the minimal activation domains from different activators in a sinaie effector. 
This lakes advantage of cooperative effects in transcriptional activation. For example, two 
different activators binding the same promoter produce a proportionately stronger effect than 
5 each of them acting alone, perhaps by having different classes of activation domains contact 
different coactivators involved in transcriptional initiation (Blau, J., Xiao, H. : McCracken, S.. 
O'Hare, P., Greenblatt, J. & Bentley, D. Mo! Cell Biol 16, 2044-55 (1996); Hampsey, M. & 
Reinberg, D. Curr Opin Genet Dev 9, 132-9(1999)). . In at least some embodiments, the • 
core sequences from Oct-2 (SEQ. ID NO. 3) and VP16 (SEQ. ID NO. 1) activation domains 
1 0 are combined in novel arrangements to maximize their effectiveness. 

It is also contemplated that the effector configurations are not limited to natural 
polypeptide chains. In at least some embodiments, polyglycol spacers are introduced 
between individual core peptide sequences. This increases the conformational flexibility and 
the ability to interact with other proteins without significant gain in molecular weisht. 
1 5 In some other embodiments, the lower mass and/'or higher potency of activation 

domains is achieved by the incorporation of nonnatural amino acids that stabilize the 
secondary structure. For example, short alpha-helical structures can be stabilized with 
introduction of side chain to side chain lactam bridges. These conformational constraints 
may lead to even shorter peptides that retain much of the conformation and activity of longer, 
20 unmodified polypeptide chains. 

In at least some other embodiments of the invention, the effector is selected to repress 
transcription. Attachment of repressor domains, i. e. : synthesis of repressor ATFs, is 
contemplated. Natural transcriptional factors typically contain activation domains or 
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repression domains, and in some cases they may contain both activation and repression 
domains (Liu, Y. Z„ Lee, I. K. : Locke : I., Dawson, S. J. & Laichman, D. S. Nucleic Acids 
Res 26, 2464-72 (1998); Tanaka, M., Clouston, W. M. & Herr, W. Mol Cell Biol 14, 6046- 
55 (1994); Tanaka, M. & Herr, W. Mol Cell Biol 14, 6056-67(1994)). Transcriptional 
factors containing repressor domains function in an analogous manner as transcriptional 
factors containing activator domains, that is, they both need to bring the respective domains 
in the vicinity of the promoter DNA to exert the effect on RNA transcription (Ptashne, "A 
Genetic Switch: Phage Lambda and Higher Organisms. Cell and Blackwell Scientific, 
Cambridge, MA, (1992); and Ptashne et al., "Transcriptional Activation By Recruitment". 
Nature 386:569-577 (1997)). As opposed to transcriptional activators, the binding of a 
transcriptional repressor to the promoter results in the decrease in the levels of RNA 
transcription. This effect is caused by the interference of transcriptional repressor domains 
with the assembly of the Polymerase II holoenzyme complex through a variety of different 
mechanisms. These mechanism range from rearrangements of a cliromatin structure through 
histone deacetylation to interference with action of activation domains (Renkawitz, R. Trends 
Genet 6, 192-7 (1990). Cohen, R. K., Putney, A., Wondisford. F. C. A: Hollenberg, A. UMol 
Endocrinol 14, 900-14 (2000); Busch, K.. Martin, B., Baniahmac. A.. Renkawitz, R. & 
Muller, M. Mol Endocrinol 11, 379-89 (1997)). While the repressor Jomains are very diverse 
and share very few common structural features, for the most p.ir: net effect on a lareet 
gene transcription is similar. 

There are many repression domains that are compatible with ii K - A I i incept, such as 
repression domains derived from Drosophila transcriptional rcprcsM,;,. even-skipped and 
Kruppel, as well as from human repressor protein Madi(Licht, J. D.. Hanna-Rose. W.. 
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Ready, J. C. : English. M. A.. Ro, M. : GrosseL M. ; Shaknovich s R. <£ Hansen, U. (1994) Mo! 
Cell Bio! 34, 4057-66; Margolin, J. F.. Friedman, J. R., Meyer. W. K v Vissing, H., Thiesen, 
H. J. & Rauscher, F. J.. 3rd (1994) Proc Natl Acad Sci USA 91, 4509-13; Han, K. & 
Manley, J. L. (1993) Genes Dev 7, 491-503; Beerli, R. R. : SegaL D. J., Dreier : B. & Barbas, 
5 C. R, 3rd (1998) Proc Natl Acad Sci USA 95, 14628-33; Eilers, A. L. t Billin, A. N. s Liu, J. 
&Ayer : D.E. (1999) J Biol Chem 214, 32750-6) ) .] Although these repressors possess 
different mechanisms of action, they all have the following unifying properties: (i) ability to 
repress a wide variety of activators and act at a distance from the binding sire ;(ii) portability ■ 
(maintaining the function when transferred to a different DNA-binding moiety); and (iii) 

10 relatively small size. For example, it has been demonstrated that a 13 amino acid peptide 
derived from the human protein Madl repression domain is able to confer the repression 
function when fused with a heterologous DNA-binding domain (Eilers, A. L. ; Billin, A. N. s 
Liu, J. & Ayer, D. E. (1999) J Biol Chem 214, 32750-6} Synthesized peptide sequences 
are derived from these repressors and may be used to make repressor ATFs substantially as is 

15 described and shown herein for activator ATFs. 

Transcriptional assays for repression in vitro and in vivo may be performed with 
templates that, for example, contain both ATF and GAL4 binding sites. The repressor ATFs 
are tested for their ability to inhibit the transcriptional activation mediated by GAL4-VP16 
and other strong activation domains fused to the G.AL4 DNA-binding domain. Alternatively, 

20 the assays for repression may be performed with transcriptional templates that contain the 
ATF binding sites incorporated in the promoter having a constitutively high level of basal 
transcription such as the CMV immediate early enchancer-promoter (Schmidt ei al. Mol. 
Cell. Bio! 10. 4404- i 1(1990)). In this case, the binding of the ATF repressor to the promoter 
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will result in the decrease in RNA transcription, and the presence of additional binding sites 
for transcriptional activation is unnecessary. 

The term "artificial repressor" has been used to indicate the strategy that involves 
blocking of the binding of transcription factors to the promoter (Maher el ai, Biochemistry 
5 31:70-81 (1992): Maher et ai, Science 245:725-730 (1989): and Larsen, H.J. and Nielsen, 
P.E., Nucleic Acids Res. 24:458-63 (1996)). This is most often achieved by triple-helix 
forming oiigonucletides (TFOs), peptide-nuleic acids (PNAs) or some other sequence- 
specific non-peptidic DNA-binding molecule designed to compete with a particular 
transcription factor for binding to the same site. The term "passive" repressors is used herein 
) to describe these known artificial repressor molecules and their mode of action. The crucial 
requirement for passive repressor is that its binding site is located very close, or overlapping 
with the binding site for a transcription factor (or some other protein) that is necessary for the 
expression of a given gene. Therefore, the passive repression is possible only on native 
promoters that contain these rare arrangements of overlapping binding sites. In contrast the 
attachment of the "active" repressor moiety to the non-peptidic DNA-binding moiety (as 
described here) will allow for a much greater flexibility and efficiency of transcriptional 
repression by artificial repressors (repressor ATFs). For example, the binding of repressor 
ATF anywhere in the promoter results in repression of transcription because the typical 
repressor domain, like even-skipped or Kruppel, is able to act over a large distance from the 
binding site. Therefore, the precise location of binding sites for repressor ATFs within the 
promoter is not critical for their action as with the "passive" repressors mentioned above. For 
that reason, repressor ATFs allow for the targeting of a much wider variety of genes than 
passive artificial repressors. In other words, the ATF concept can be described as a novel 
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method for the delivery of '"active'' trancriptional effectors (activators or repressors) to the 
promoters in order to regulate the transcription of a target gene in a predictable manner. 
Since ATFs are completely artificial (synthetic) molecules, the present invention allows for 
the delivery to the promoters and the testing of a wide varieiy of completelv artificial 
chemical moieties for the ability to modulate the transcription of the selected (target) gene. 

The DNA binding domain and the effector domain are linked, e.g.. covalently linked, 
through the flexible linker, B. The linker of B portion of an ATF herein is of minimum 
length and maximum flexibility, so that the modulator moiety or domain of the ATF, when 
the DNA binding domain is bound to its recognition siie near a promoter on a DNA template, 
is capable of diffusing relatively freely within a minimum-distance from the DNA template, 
and can interact molecularly with a surface of the various components of the transcriptional 
machinery of the cell more particularly, with a surface of a protein component of the RNA 
polymerase II holoenzyme. Alternatively, the effector may interact with other proteins 
involved in transcriptional regulation such as histones, histone modifying enzymes or other 
transcripiion factors. In at least some embodiments of the invention, the linker is of a 
flexibility and length such that the effector moiety or domain is free to move above the 
surface of the DNA. 

In at ieast some embodiments, the flexible linker of the present invention is at least 
5 A, 10 A or 15 A, or at least 15 A, or at least 20 A ; or at least 28 A in length. The length of 
the linker may be selected to permit accessibility of the effector domain to interaction 
(binding) with RNA polymerase II holoenzyme or other associated proteins, li is recognized 
thai the length of the linker may vary dependent upon, among other factors, the location and 
orientation of the DNA binding domain at the DNA template or the chemical composition of 
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the effector domain. The linker may include at least 10, or at least 20, or at least 30 atoms in 
the chain (backbone) between the two domains, and may include up to 50, or up to 100 
atoms. 

The linking groups is comprised mainly of carbons, hydrogen, nitrogen, oxygen, 
sulfur and phosphorus. Suitable linkers include flexible moieties, such as polyglycols, or 
other polyalkoxy moieties, or oligomers derived from monomers of nucleotides, natural or 
non-natural amino acids and lower alkyls, and preferably oxygen-containing organic 
moieties. The linker is preferably of low molecular weight, chemically inert'and water 
soluble. In at least come embodiments, the linker is an oxygen-containing moiety, 'which 
improves hydrophilicity and is generally desirable for drug development 

The linker is a flexible linkage between the DKA binding domain and the effector 
domain, and is selected such that the linkage between the two domains occurs while the other 
domains continue to perform their intended functions. The linker is a crucial component of 
the ATF composition since it enables the optimal geometric configuration and therefore 
maximizes the potential biochemical activity of the ATFs. The linker is moiety covalently 
attached to the DNA binding domain and the effector domain. For covalent linkage, various 
functionalities may be used, such as amides, carbonic acid derivatives, ethers, esters, 
including organic and inorganic esters, amino, urethane, urea and the like. To provide 
linking, the particular domain e.g. the DNA binding domain or the effector domain, may be 
modified, for example, by oxidation, hydroxylation. substitution, reduction, etc., to provide a 
site for coupling to the linker. The domains may terminate in a reactive amine, carboxylic 
acid, hydroxy, or thiol group, or the like, which are susceptible to conventional peptide 
and/or oligonucleotide reactions. It will be appreciated thai modifications to the domain that 
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do not significantly effect the domain function are preferred. Depending on the domain, 
there may be a number of sites available for coupling through the linker. Any suitable linker 
and linker coupling may be used, so long as it does not interfere with the cellular functions of 
the two domains. 

5 . In at least some embodiments., a linker may be readily included in the DNA or PNA 

strand during synthesis of the DNA or PNA strand in an automated chemical synthesis. One 
moiety that may be so incorporated is polygylcol spacer. For example, a polyglycol spacer is 
attached at the end of the DNA binding domain and a reactive group is provided to the 
terminal end of the polyglycol linker by addition of a modified thymidine bearing a terminal 

10 primary amine (See. FIG. 2). 

In at least some embodiments, a bifunctionai crossiinking agent is used to join the 
linker to either the DNA binding or effector domains. Suitable crossiinking agents include 
small bifunctionai molecules capable linking two target groups. The target groups typicallv 
are the functional groups discussed above. Exemplary thiol-thiol crossiinking groups include 

15 dibromobimane. Exemplar}'' amine-amine crossiinking groups include bis(succinimidy! 
esters), e.g.. bis(succinimidyl esters) of 5 ; 5 : -dithiobis-(2-nitrobenzoic acid), or ethylene 
glycol bisfsucccinic acid). Exemplary amine-thiol crossiinking agents include amine- 
reactive maleimide and iodoacetimide derivatives, such as succinimidyl /ra/7j-4- 
(maleimidylmethy])cyclohexanc-l-carboxylaie s succinimidyl 5-maleimidylbenzoaie. 

20 succinimidyl 6-maleimidylhexanoate. or 4-nitrophenyl iodoacetaie. 

Coupling of amine and carboxylic acid groups may also be facilitated by "zero 
iengtlr crosslinks, a crossiinking agent thai is not incorporated into the final product. 
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Exemplar}- agents include l-ethyl-3-(3-dimethylaminopropyl)carbodiimide and 2-ethoxy-l- 
ethyxocarbony- 1 ,2-dihydroquinoline. 

An exemplar}' ATF is shown in FIG. 2A 5 which was designed and synthesized using 
the components depicted schematically in FIG. 2B. A DNA-binding domain was designed 
based on a modified triple-helix forming oligonucleotide. The 22-base sequence of SEQ ID 
NO: 5 (5TTGTGGTGGGTGGGGTGTGGGT3 ') binds the corresponding target double 
stranded sequence by forming a triple-helical complex at physiological pH (Skoog et al., 
Repression Of Bacteriophage Promoters By DNA and RNA Oligonucleotides. Nucleic Acids 
Res. 21:2131-2138 (1993)). A long (e.g., greater than 15 A) and flexible polyglycol linker 
was introduced to either the 5 : or 3 ' end by automated chemical synthesis. This was 
achieved by synthesizing two molecules of having the base sequence of SEQ. ID NO: 6 
(5TTGTGGTGGGTGGGGTGTGGGTXYC3 5 ) and SEQ. ID NO: 7 
(5'CYXCTTGTGGTGGGTGGGGTGTGGGT3 ? ) : and where the X represents the Spacer 
Phosporamidite 18 (Hexaethyleneglycol spacer, Glenn Research, 22825 Davis Drive. 
Sterling, VA 20164), and the Y represents the thymidine residue bearing the primary amine 
group on a short tether (Amino-Modifier C6-dT. Glenn Research). This primary amine is 
able to react with various kinds of chemicals that do not affect the rest of the molecule. This 
enables the modified residue to serve as an "anchor for the coupling of various functional 
domains ("effectors"), the activity of which can than be tested in transcription assays. The 
effector can be a synthetic peptide, a non-natural peptide (having amino-acids that do not 
occur in nature), or a non-pepiidic organic molecule. 

The ATFs of the invention are useful in modulating the expression of a target gene. 
The target gene can be any gene that is secreted by a cell, so that the encoded product can be 
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made available (or suppressed) ai will. Transeripiion of many genes in vivo are found 10 be 
positively or negatively regulated; while the basal level of other genes., often referred to as 
"housekeeping" genes, can be relatively constant during the lifetime of a cell. Further, 
regulation of any gene can be specific temporally, only expressed in normal cells at a certain 
stage, of development, or can be tissue specific., only expressed in certain tissue or cell or 
organ types. 

A temporally regulated gene, such as the gene for nestin. or for a metalloproteinase 
such as a type II collagenase, may be expressed during fetal development in normal cells, but 
can be up-regulated in a brain tumor or melanoma in the case of nestin, or during metastasis 
of a tumor in the case of type II collagenase. Alternatively, a fetal gene such as the gene for 
embryonic hemoglobin, might be turned on to substitute for insufficient adult hemoglobin in 
a variety of anemia type diseases, such as sickle cell anemia. 

For these and many other applications, promoter specific artificial transcription ? 
factors (activators or repressors) are provided, particularly those of low molecular weight 
capable, of acting as drugs, for example, as negative artificial modulators to turning off 
tumor-specific genes. An ATF useful as a therapeutic drug is of sufficiently low molecular 
weight to be administered orally, and to permeate a target cell in a subject. 

Pharmaceutical Compositions. 

In another aspect, the present invention provides pharmaceuiically acceptable 
compositions which comprise a therapeutical ly-effeciive amount of one or more of the ATF 
compositions of (he present invention, formulaied together with one or more 
pharmaceutical!}- acceptable carrier(s). The pharmaceutical compositions and methods 
describee herein can include one or more ATF compositions of the present invention. 
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The phrase n therapeutically-effective amount" as used herein means that amount of a 
ATF composition, or composition comprising such an ATF composition, which is effective 
for the ATF composition to produce its intended function, e.g., the modulation of gene 
expression. The effective amount can vary depending on such factors as the type of cell 
growth being treated or inhibited, the particular type of ATF composition, the size of the 
subject, or the severity of the undesirable cell growth or activity. One of ordinary skill in the 
art would be able to study the aforementioned factors and make the determination regarding 
the effective amount of the ATF composition without undue experimentation." 

The phrase "pharmaceutical!}' acceptable" is employed herein to refer to those ATF 
composition containing such compounds, and/or dosage forms which are, within the scope of 
sound medical judgment, suitable for use in contact with the tissues of human beings and 
animals without excessive toxicity, irritation, allergic response, or other problem or 
complication, commensurate with a reasonable benefit/risk ratio. 

ATF compositions of the present invention can exist in free form or, where 
appropriate, in salt form. Pharmaceutical!}' acceptable salts and their preparation are well- 
known to those of skill in the art. The pharmaceutical!}' acceptable salts of such compounds 
include the conventional non-toxic salts or the quaternary ammonium salts of such 
compounds which are formed, for example, from inorganic or organic acids of bases. The 
compounds of the invention may form hydrates or solvates. It is known to those of skill in the 
art that charged compounds form hydrated species when lyophilized with water, or form 
solvaied species when concentrated in a solution with an appropriate organic solvent. 

The amount of compound which will be effective in the treatment or prevention of a 
particular disorder or condition will depend in pari on the nature of the disorder or condition, 
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and can be determined by standard ciinical techniques. In addition, in vitro or in vivo assays 
may optionally be employed to help identify optimal dosage ranges. Effective doses may be 
extrapolated from dose-response curves derived from in vitro or animal model test systems. 
The precise dosage level should be determined by the attending physician or other health care 
provider and will depend upon well linown factors, including route of administration, and the 
age, body weight, sex and general health of the individual; the nature, severity and clinical 
stage of the disease; the use (or not) of concomitant therapies; and the nature and extent of 
genetic engineering of cells in the patient. 

The invention also provides a pharmaceutical package or kit comprising one or more 
containers holding one or more ingredients including a precursor composition having flexible 
linker covalenily bound to a DNA binding domain, the DNA binding domain having affinity 
for a DNA binding site on a DNA template sufficient to bind the site and modulate the 
transcription at a promoter precursor composition contains a reactive end group that can be 
used to couple the precursor compound to a test compound of interest for assessing the 
activity of the composition in transcription. The kit also includes transcription mixture 
comprising a DNA template and a eukaryotic UNA polymerase molecule that forms a 
complex with the DNA template. Optionally associated with the Lu ma> be instructions for 
using the precursor composition according to the methods of the -mention. 

Assays. 

The experimental system described herein can be used 1.. \>: : ^tivaiion or 
repression potential of a chemical adduct in vitro; however adorn,. :;.:! embodiments of the 
invention herein include in vivo assays. The experimental system described herein activates 



WO 02/31166 



PCT/US01/32000 



transcription from a linear (typically in in vitro assays) or circular plasmid template that 
could be used both in in vitro and in vivo assays. For example, the binding of ATF to the 
promoter DNA is performed in vitro using circular (plasmid) DNA transcription templates, 
and the resulting pre-formed ATF-template complex is introduced into the tissue culture by 
the various methods of transformation, including transfection, electroporation, liposome 
assisted techniques, etc. The quantification of the resulting RNA transcript from a reporter 
gene transcriptionally downstream from the initiation site (i.e., (3-galactosidase, 
chloramphenicol acetyl transferase, etc.) in vivo reveals the activity of the tested ATF. 
Alternatively, a sample of cells that contain a stable plasmid having the transcriptional 
template is treated with the ATF. In this case, the binding of the ATF to the corresponding 
sites in the template promoter occurs in vivo. Being a relatively small, largely non-peptidic 
molecule, the ATF readily penetrates the cell membranes in an analogous way to penetration 
by antisense oligonucleotides. Alternatively, ATFs are enclosed in liposomes to assist them 
in penetrating the cell membrane. Reporter gene activity in cells in the presence and absence 
of the ATF reveals the extent of transcriptional activation or repression 

The ATF of the invention is used in a method for assaying a test composition for 
activity as a transcriptional modulator. The method includes linking the test compound 
covalently to a flexible linker domain which is covaiently bound to a DNA binding domain 
to provide a test composition, the DNA binding domain having affinity for a DNA binding 
site on a DNA template sufficient to bind the site and to modulate transcription at a promoter: 
contacting the test composition with a transcription mixture including a DNA template, a 
eukaryotic RNA polymerase molecule capable of forming a complex, either directly or 
indirectly through other proteins, with the test composition and the DNA template, a buffer 
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and substrates under conditions suitable for RNa synthesis, such thai RNA is svnthesized: 
and determining the quantity of FLNA produced in the presence of the test composition 
compared to a basa! level in the absence of the test composition, which is a measure of the 
activity of the test composition as a ATF composition. In some embodiments, the DNA 
5 binding site is a plurality of repeats of the binding site sequence 

The synthesized KNA may be quantified using any convention detection system. 
Such systems are for quantitation of RNA product are well known in the prior art. 

. The ATF compositions of the invention could be adopted to develop in vivo 
screening system for novel ATFs as well as for therapeutic applications as described above 
10 (precise regulation of transgenic cells in vivo). For example, instead of transient transfection 
(as described in in vivo experiments herein below), stable transfected cell lines can be 
generated with a reporter construct incorporated into the chromosome. Therefore, any new 
ATF can be tested for the ability to activate or repress this reporter gene. Endogenous genes 
can also be used as targets, however, in this case, the signal is detected on a DNA arrays 
if (chips). 

The high-density DNA and oligonucleotide microarrays ("DNA chips") allow 
monitoring the expression of many different genes simultaneously, which allows extension of 
the in vivo assay because it can provide a number of clues about the effectiveness of a 
particular ATF design. For example, the monitoring of early changes in gene expression 
20 pattern following the treatment of tissue culture cells with ATFs reveals which genes are 
directly affected by the ATF. ATF targets in genome could be identified without prior 
knowledge about the sequences in the promoter. The relative levels of gene expression also 
provides useful information on the activity of a particular effector. In this manner, both 
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DNA-binding and effector domains are characterized in greater detail simultaneously, along 
with the identification of potential gene targets for possible medical applications in the 
future. The microarray analysis is not limited to one type of cell; detection kits are 
commercially available for many different kinds of eukaryotes, from yeast to humans 
(Affymetrix). 

The ATF compositions of the invention may be used in a variety of applications, such 
as the ones described herein above, and in particular may be used in the gene therapy. In 
many instances, the ability to switch a therapeutic gene on and off at will or the ability to 
titrate expression with precision are important for therapeutic efficacy. This invention is 
particularly well suited for achieving regulated expression of a therapeutic target gene in the 
context of human gene therapy. 

The invention is further illustrated in the following examples, which are provided for 
the purpose of illustration only and are not intended to be limiting of the invention, the full 
scope of which is shown in the claims that follow the specification. 

EXAMPLES 

Example 1 . Design and Synthesis of the ATF 

In these experiments, the effector domains were a scries <>: ntk-tic peptides derived 
from the well-studied activation domain of the Herpes simplex \ itui protein VP 16, one of the 
strongest transcriptional activators found in nature. These peptides contain the 29 amino acid 
sequence of SEQ. ID NO: 8 (CGSDALDDFDLDMLGSDALDDFDLDMLGS), which has a 
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cysteine residue attached to two copies of the VP 16 amino acid sequence of SEQ. ID NO: 12 
SEQ ID NO: 8 including the double copy of the VP] 6 amino acid retains about 70% of the 
activity of the full-length VP 16 when fused to the GAL4 DNA-binding domain (Seipel ci ai. 
Different Activation Domains Stimulate Transcription From Remote (Enhancer) .And 
Proximal (Promoter) Positions. EMBOJ. 1 1 :496 1-4968 (1992); and Nyanguile er aL A 
Nonnatura! Transcriptional Coaciivaior. Proc. Nail. Acad Sci. USA 94:13402-13406 (1997)). 
The thiol-bearmg cysteine residue is added at the amino .terminus where it serves as the 
attachment point for conjugation to the rest of the ATF molecule. 

All peptides were synthesized on an automated peptide synthesizer using siiindard 
FMOC chemistry (Barberis ci aL Contact With A Component Of The Polymerase II 
Holoenzyme Suffices For Gene Activation. Cell 81:359-368 (1995)). The chemical 
conjugation of a peptide and the oligonucleotide was accomplished through the use of the 
Afunctional crosslinker. N-hydroxy succinimidy] 6-maleimidylhexanoate ? obtained from 
Pierce Chemical Co. r Rockford, 1L (see FIG. 2A). This commercially available crosslinker 
has two functional groups separated by a six carbon chain: an ester of succinic acid reacts 
specifically with primary amines, and a maleimide functional group that reacts specifically 
with thiols under certain conditions. The coupling reaction was performed in successive 
steps. Since the ester group was the more labile of the two., the crosslinker was first coupled 
to the primary amine in the oligonucleotide. Excess crosslinker was removed by chloroform 
extraction and ethanol precipitation. The second step involved the reaction between the thiol 
group in the peptide and the maleimide functional group of the crosslinker. The purification 
of the conjugate from the excess peptide and the unreached DNA was accomplished bv 
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reverse-phase HPLC. The purified aliquots were dried in a lyophilizer and subsequently 
dissolved in ultra-pure water (Ambiom Austin, TX) and stored at -70°C. 

Example 2. In vitro Transcription Assavs 

Two 26-base oligonucleotides 5 'having the sequence of SEQ. ID NO: 9 and SEQ. ID 
NO: 10 were annealed and ligated to yield a series of double-stranded DNA fragments 
containing multiple binding sites for triple-helix formation. A fragment containing 5 copies 
of the site as shown in the duplex of SEQ ID NO: 9 and SEQ ID NO: 10 was purified by 
agarose gei electrophoresis and inserted into the Hindlll restriction site in the polylinker of 
an embodiment of the G5E4T series of transcription templates. 

FIG. 3 shows the promoter regions of the transcription templates. In FIG. 3 A, the 
control template contains promoter to modulated transcription five GAL4 and five ATF 
binding sites incorporated in the promoter at -53 and -155 bp relative to the +] transcription 
start site, respectively; and in FIG. 3B the ATF assay template contains five ATF binding 
sites incorporated at -65 bp. 

The resulting plasmids were linearized by digestion with EcoRL purified and the 
linear templates were stored in ultra-pure water at -20°C. The transcription assay was 
initiated by incubation of 200 ng of linearized transcription templates with ATFs in the 
binding buffer containing 10 mM Tris pH 8 ; 40 mM MgCN and 1 00 mM KC1 at room 
temperature for 24 hours. The total volume of binding reaction was 5 microliters. After 
binding reaction, salt and buffer conditions were adjusted by the addition of 5 microliters of 
water. 6 microliters of IX HeLa buffer (20 mM HEPES pH 7.9, 100 mM KCl 0.2 mM 
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EDTA. 0.5 mM DTT, 20% glycerol) and 5 microliters of HeLa nuclear extract (Promega. 
Madison, WI). An aliquot of GAL4-VP16 protein (1 microliter total) was added 10 the 
control templates and all reactions were incubated for 1 0 minutes at 30°C. The transcription 
was initiated by the addition of 1 microliter of a mixture of ribonucleotide triphosphates 

5 (ATP, CTP : GTP, 10 mM each) and 0.5 microliters of 32 P-a-UTP (MEN, Boston. MA). After 
farther incubation at 30°C for 30 minutes all transcription reactions were terminated by the 
addition of 100 microliters of stop buffer (0.5 M sodium acetate, 0.2% SDS, 10 mM EDTA. 
1 microgram/m! glycogen, 1 microgram/ml Proteinase K). The reactions were vortexecL 
incubated at 37°C for 10 minutes, extracted with phenol/chloroform, and precipitated with 

10 250 microliters of ice-cold ethanol. The pellets were dissolved in 20 microliters of standard 
denaturing formamide/dye mix and were loaded and run on a denaturing 6% polyacrylamide 
gel. 

The results are shown in FIG. 4. The lanes 1 and 2 show the activation by the control 
fusion protein GAL4-VP16 from control templates (FIG. 4A) ; while all of the other lanes 

15 represent the activation by the ATF from the ATF assay template (FIG. 4B). The 250 base 
transcript initiated at the correct initiation site in each sample is indicated by the arrow. Lane 
1 shows the level of basal transcription, in the absence of any transcription factor, from the 
control template. Lane 2 shows that the addition of the control fusion protein GAL4-VP16 
results in marked increase of the quantity of transcription from the control template. Lane 3 

20 shows basai transcription from the ATF assay template. Lanes 4-7 show, as expected, 

transcription activation as a function of increasing amounts of ATF result. Lanes 5, 6, and 7 
contain five-fold, 100-fold, and 500-fold more, respectively, ATF than lane 4. The curve 
shows results in a "bell curve ,: response. At the optimal concentration of ATF (lane 5), the 
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level of transcriptional activation is equal to, if not higher than, the maximum levels of 
activation observed from the control template in the presence of control activator protein 
GAL4-VP16. 

The results in FIG. 4 show that the natural activator VP 16 fused to the GAL4 binding 
site to produce the fusion GAL4-VP16, strongly activated the transcription from the control 
template containing five GAL4 binding sites. The run-off transcript initiated at the -rl 
transcription start site is 250 bp long (indicated by arrow, Figure 4). At the same time, 
ATF(29) is able to activate the transcription from templates containing five ' ATF binding 
sites in the promoter (Figure 4). The activity of ATF(29) is absolutely dependent on the 
presence of the corresponding binding sites in the promoter as it does not activate 
transcription from the control templates or other templates lacking ATF binding sites (data 
not shown). The comparison of RNA transcripts confirms that both GAL4-VP16 and ATF 
initiate the transcription from the same, "correct" (+1) site in the promoter adjacent to the 
TATA box. Most unexpectedly, the maximum level of transcriptional activation by 
ATF(29) was comparable to the level that could be obtained with GAL4-VP16. For 
example, the quantitation of RNA transcripts reveals that GAL4-VP 1 6 and ATF(29) activate 
transcription 30-40 fold above basal level (Figure 4). This result corresponds to previously- 
reported maximal levels of in vitro transcriptional activation by GAL4-VP 1 6 from linearized 
templates containing 5 GAL4 binding sites (Haile. D. T. & Parvin, J. D. (1 999) J Biol Chem 
274, 21 13-7)These levels of activation are obtained with ATF concentrations between 2-40 
nM, and further increase in concentration leads to diminishing activation (Figure 4). This 
data also indicates another similarity of ATFs with strong natural activators: the ability to 
"squelch" transcriptional activation, presumably by sequestration of the target components of 



-35- 



WO 02/31166 



PCT/US01/320W) 



the transcriptional machinery by excess amounts of strong activators (Ptashne. M. (1988) 
Nature 335. 683-9).Both the activation and the squelching ability are completely dependent 
on the presence of the attached AD peptide because the "truncated" ATFs lacking the AD 
proved completely inactive. 

A. Significance of fully functional highly potent ATFs. 

While there are many successful drugs that target gene products, thus far there is no 
effective and practical strategy for pharmaceutical targeting of the gene itself. The 
manipulation of gene expression at the level of transcription has a great potential for drue 
development because basic regulatory mechanisms are common for a great majority of 
eukaryotic genes (Roberts, S. G. (2000) Cell Mo! Life Scf 57, 1 149-60). In principle, a small- 
molecule drug designed to modulate transcription of a specific gene can be applied to cause a 
similar effect on a different gene with only small modifications in its DNA-binding moiety. 
Therefore, cell permeable drugs able to turn specific genes on or off will form the basis for a 
universal therapeutic strategy, applicable to many different genes of medical interest. 

The progress in this field has been hampered by the fact that designing an effective 
ATF proved to be a major challenge. For example, a previous attempts to combine synthetic 
peptides with polyamide DNA binders produced transcriptional activators that bind DNA via 
a minor groove and exhibit very low activity in in vitro assays (Mapp et ah supra). Since 
these studies employ only the reporter gene containing G-less cassette, it is not clear whether 
the overall increase in RNA is due to the specific or non-specific (i.e. random) transcript 
initiation. 

However, to demonstrate the usefulness of ATF as novel tools and develop the first 
practical applications, it is necessary to achieve a biological activity and specificity to 
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approach that of common!}' used natural transcription factors. Results of the present study 
show for the first time that it is possible to design a fully functional ATF that (i) initiates 
RNA transcription from the correct site and (ii) possesses biological activity comparable to 
strong transcriptional activator proteins like GAL4-VP16. 

B. Activation potency depends on A TF chemical structure. 
The design of highly potent ATF was achieved througLseveral iterations of 
molecular design and synthesis. Without being bound by any particular mechanism, the 
chemical configuration (in particular , length and flexibility) of the linker likely plays a 
major role in the optimal presentation of the effector to the transcriptional apparatus. 
Namely, the combination of a long and flexible polyglycol chain with a more rigid and 
hydrophobic nucleotide- crosslinker structure seems be able to mimic the molecular 
geometry of transcription factors particularly well. In fact, the in vitro transcription assays 
described in FIGS 3 and 4 imply that the ATFs could be even more potent transcriptional 
activators than GAL4-VP16. For example, GAL4-VP16 protein binds to each of the five 
sites in the promoter as a dimer, while ATF molecules bind the corresponding triple-helix 
target sites as monomers (Carey, M.et al (1989) J Mol Biol 209, 423-32). . However, five 
ATF molecules bound to the promoter elicit a similar effect on transcription in vitro 
compared to ten GAL4-VP16 molecules (Figure 4). This high potency of ATF molecules 
may be due io the simple extended chemical structure involving no bulk}' protein domains 
that leaves effectors much more exposed to interaction with RNA Polymerase II holoenzyme 
and/or other proteins. The exposed effector would in effect facilitate the recruitment of the 
holoenzyme to the promoter and the initiation of RNA transcription. This conclusion is in 
accord with previous work showing that 14-mer peptide derived from VP 16 (Figure 2A) is 
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inactive when fused to the GAL 4 DNA binding domain through recombinant DNA 
techniques (Seipel et al, supra).. In contrast, the present study demonstrated that the same 14 
mer peptide sequence shows a remarkable biochemical activity within the context of ATF. 

5 Exam ple 3. An ATF Having the Linker Covalentlv Attached to the 3 : Terminus of 

the DNA Binding Domain 

The ATF in FIG. 2A : which is a strong positive transcriptional effector (FIG. 4, lanes 
3-7).. has a DNA binding domain A, covalentlv attached to a linker B, via the 5 ; end of the 

1 0 DNA binding domain. The polylinker was attached at the 3 ! end, and this ATF too had 

strong positive transcription effector activity (FIG. 4, Lanes 8-9). Lane 8 shows transcription 
mediated by the 3 : A-B-C structured ATF.. and lane 9 shows transcription by a five-fold 
increase in quantity. The amount of 250 bp run-off transcript found in lane 8, compared to 
that in iane 3 showing basal levels in the absence of any effector for the template, indicates 

! 5 that the covalent linkage of a flexible polylinker to the activation domain provides a 

configuration which is a powerful ATF. [The presence of a linker plays a role in the positive 
function, when contrasted to other configurations (Kuznetsova et al.. supra).] 

Example 4. An ATF Having L-and-D Amino Acids Activating Domains and V and 
20 5 T Linked DNA Binding Domains. 
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This example shows the versatility of the ATF compositions of the invention and 
establishes functionality for both L- and D- versions of the activator for domain as well as 3' 
and 5' linked DNA-binding domains. 

A. Synthesis of ATFs 

The introduction of a linker into the TFO was achieved by synthesizing two 
molecules having the sequence of SEQ ID NO: 6 (for 5' ATFs) and SEQ ID NO: 7 (for 3' 
ATFs) where X represents the Spacer Phosporamidite 18 (Glenn Research) and the Y 
represents the modified T residue bearing the primary amine group on a short tether (Amino- 
Modifier C2-dT, Glenn-Research). 

The two different forms (L- and D-) of the 29-mer peptide having the sequence of 
SEQ ID NO.8 amino acids by standard methods (Nyanguile et ah Proc. Nail. Acad. Sci. USA 
94: 13402-6 (1 997); and Stanojevic et al Natl. Struct Biol. 2: 450-457 (1 995)). The 
chemical conjugation of a peptide to the rest of the ATF molecule was accomplished through 
the use of a Afunctional crossiinker succinimidyi 6-maleimidylhexanoaie (EMCS.. Molecular 
Probes) as described above in Example 1. 

B. Transcription assays. 

The templates for in vitro transcription were made by inserting 5 direct repeats of a 
triple helix binding DMA sequence having a sequence of SEQ ID NO: 1 1 
(5-TTCTCCTCCCTCCCCTCTCCCTCTT3') into the Hind III restriction site in the 
polylinkcr ofthe GnE4T series of transcription templates (Lin et al. Cell 5A: 659-664 
)1988)) plasmids were linearized by digestion with EcoRI and/or Hind 111 and the bindin" of 
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ATFs to the template was performed by incubating 20-10(1 ng of linearized ATF transcription 
templaies with ATFs in the binding buffer containing 10 mM Tris pHS. 40 mM N4gCI 2 and 
100 mM KC1 at room temperature for 24 hours. Ample incubation time was permitted for 
optimal complex formation between ATFs and the promoter. Subsequently, an aliquot of 
5 diluted GAL4-VP16 protein (1 microliter total) was added to the control transcription 

templates and all reactions were incubated for 10 minutes at 30 degrees C. The transcription 
reactions were performed with HeLa crude nuclear extract (Promega) using the standard 
procedures (Promega protocols). The final RNA transcripts were resolved on "6% denaturing 
polyacrylamide gel. Dried gels were exposed on both Kodak Biomax MS film and Fuji 
10 Phosphoimager plates. The quantitation of signals was performed with Fuji MacBAS 
software. 

FIG. 5 shows a side-by-side comparison of transcription activation by GAL4- 
VP16and the following ATFs: 3" ATF, 5' ATF (D) and 3" ATF (D). Comparison of lanes 
2.4.5.6 and 7 indicates that all ATFs are able to activate transcription at least as strongly as 

15 GA14-VP 16 in vitro. The comparison of activation by 3 : ATF and GA1 4-VP1 6 reveals that 
3' ATF also has a very strong effect on transcription. Despite having slightly different 
chemical configuration, 5 : ATF and 3 s ATF are very similar in their biological effects, 
namely, activation strength and the squelching ability. Furthermore, the results in FIG. 5 
demonstrate thai ATFs synthesized with activation domain consisting of D amino acids are 

20 also potent transcriptional activators. It is evident that both 5 : ATF(D) and 3 s ATF(D) 

activate the transcription from the correct initiation site resulting in a 250 bp RNA transcript 
that is virtually indistinguishable from those generated by other ATFs or by GA14-VP16. In 
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VP16.. 



WO 02/31166 



PCTAJS01/320IM) 



Example 5. In vivo Transcription. 

A. Synthesis of the ATF. 

The ATF molecule was prepared substantially as described in Example 1. The ATF 
structure included the 22-mer triple-helix forming oligonucleotide (TFO) of SEQ. ID NO: 5. 
which has been shown to form a stable triple-helical complex with target DNA at 
physiological pH. A long and flexible polyglycol linker is inserted either near the 5 T end of 
the TFO. The distal end of the linker bears a modified thymidine residue containing the 
primary amine group on a short, two-carbon chain tether. This primary amine serves as an 
anchoring site for the coupling of transcriptional activation domain (AD). The AD consists 
of a 29-mer (SEQ ID No:8) or 14-mer peptide (SEQ ID NO: 12) sequence derived from 
Herpes simplex viral protein VP 16. A thiol-bearing cysteine residue is incorporated at the 
amino terminus to allow for a covalent linkage with the rest of the ATF molecule. The 
chemical conjugation of the peptide to the rest of ATF molecule was accomplished through a 
bifunctional crosslinker. The coupling reaction resulted in two forms of ATF molecule 
ATF(29) carrying 29 amino acid VP 16 peptide, and ATF(14), earn mi; 14 amino acid VP 16 
peptide. 

B. In vivo ATF Activation. 

The ability of ATFs to activate transcription in vivo wa> i-\.ir\r.::J\ h\ transient 
cotransfection assays in BHK-21 tissue culture cells. As a conuul iv;:s:\.!ij . u reporter 
construct containing the minimal HSV thymidine kinase promoicr dm 11.7 ihi- expression of 
chloramphenicol acetyl transferase (CAT) reporter gene was used. The ^inscription 
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template was constructed by incorporating the oligonucleotide with 5 copies of ATF binding 
sites in between the Hindlll and BamHI restriction sites of the polylinker upstream of the 
control template promoter (FIG. 6 A). The transfection mixture contained I microgram of the 
piasmid DNA and 50 nM ATFs. Transections were performed by using polycationic lipid 
LipofectAMINE (Life Tech.). Cells were harvested 48 hours after transfection and CAT 
assays were performed using FAST CAT (Molecular Probes, Inc.) as substrate, and 
visualized and quantitated on a Fluorimager (Molecular Dynamics). Each reporter construct 
shown in FIG. 6A was transfected into cells with and without ATF molecules'and the 
activation signal was measured with standard methods. 

The results show, in FIGs. 6B and 6C reveal that-ATF(29) activates transcription 5- 
fold above the basal level. At the same time, the ATF(14) caused nearly 30-fold activation of 
transcription from the same template. This effect is sequence-specific since none of the 
ATFs were able to activate transcription from the control template lacking the ATF binding 
sites. Similarly to in vitro assays, the intact ATF structure is necessary for in vivo function as 
well; namely, ATFs having no peptide attached were not able to activate the transcription 
from either template. 

The results of co-transfection assays in tissue culture cells demonstrate for the first 
time the substantial biochemical activity of ATFs in an intracellular environment, and 
further confirm the validity of our design. The ATFs bind to the promoter and elicits a strong 
(up to 30-fold) and sequence-specific activation of transcription from the reporter gene upon 
introduction into tissue culture cells (FIGS. 6B and 6C). The apparent ability of ATF(14) to 
cause a much stronger effect in vivo than ATF(29) is likely due to the differences in cell 
permeability. Since ATF(14) contains the shorter peptide effector with lower electrostatic 
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charge, ii is expected to possess an increased cell permeability compared to the larger 
ATF(29). The magnitude of transcriptional activation signai obtained with ATFs should be 
taken in context of the specific experimental set-up. Namely, while in vitro assavs allow the 
direct comparison between synthetic and natural transcription factors, it would be very 
difficult to interpret such experiments in tissue culture assays. Namely, transfection assavs 
require the introduction of GAL4-VP16 expression constructs into the living cells, resultina 
in continuous synthesis of the GAL4-VP16 protein inside the cell. This would be in stark 
contrast with the introduction of ATF from through the cell membrane, generally at the start ' 
of the experiment. Therefore, the calibration of intracellular concentrations of natural vs. 
synthetic activators over the course of the experiment would be very difficult. 

Example 6. Methods for Testing Low Molecular Weight Potential Drugs 

The efficiency and potency of an embodiment of the ATFs described here, having 
VP 16 amino acid sequences, indicates that by varying the activator chemical moiety, 
covalently attached as an abduct to the flexible polylinker. in place of the VP16-derived 
effector domain of the ATF, various chemicals could be tested in transcription assays. These 
chemicals can include low molecular weight potential dings, having a molecular weight of 
less than 3,000 or less than 1,500 daltons. Preliminary results suggest that a wide varietv of 
molecules may be able to mimic the function of activation or repression domains. For 
example, the 29 amino acid VP 16 sequence synthesized using one or more non-naturallv 
occurring D-amino acid shows some activation potential as compared to the sequence made 
from naiural, L-amino acid residues (FIG 5). Using the methods provided herein, the ability 
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to activate or repress transcription can be shown not to be limited to a moiety having a 
structure composed of peptides, and that non-peptidic moleculesand efficient activation or 
repression domains. 

The non-peptidic nature of the DNA-binding domain of other potential ATFs 
5 envisioned herein, along with the linker and small molecule putative effector domains, 
enables development of smaller, cell-permeable molecules with therapeutic pharmaceutical 
potential. The chemical structure and composition of the embodiments of the ATF herein 
could serve as a starting point in development of. or screening for. novel classes of drugs for 
manipulation of gene expression at the level of transcription. 

0 

Those skilled in the art will recognize, or be able to ascertain, using no more than 
routine experimentation, numerous equivalents to the specific substances and procedures 
described herein. Such equivalents are considered to be within the scope of this invention, 
and are covered by the following claims. 
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What is claimed is: 

i . A composition for modulating transcription of a eukaryoiic gene, comprising 
a non-peptidic DNA binding domain, 
a transcripiional effector, and 

5 a flexible linker, one end of the linker being covalently bound to the DNA binding 

domain, and the other end of the linker being covalently bound io the transcriptional 
activator. 

2. The composition of claim 1. wherein the effector activates transcription. 

3 . The composition of claim L wherein the effector represses transcription. 

10 4. The composition of claim L wherein the transcriptional effector binds a co- 

effecting protein, e.g. a repressor or activator co-effecting protein. 

5. The composition of claim 3 3 wherein the transcriptional repressor binds a 
histone protein. 

6. The composition of claim 1. wherein the DNA binding domain is a nucleic 

3 5 acid. 

7. The composition of claim 6 : wherein the nucleic acid is a modified nucleic 

acid. 

8. The composition of claim 7, wherein the modified nucleic acid is selected 
from a group of a nucleic acid having a modified backbone, a nucleic acid having a modified 

20 base, and combinations thereof. 

9. The composition of claim 8, wherein the modified backbone is selected from 
the group consisting of phosphorothioates and peptide nucleic acids. 

10. The composition of claim 1. wherein the DNA binding domain is a peptidic 
nucleic acid. 
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1 1 . The composition of claim 6 3 wherein the linker is bound to 3' the end or the 
end of the nucleic acid. 

12. The composition of claim 1, wherein the DNA binding domain does not 
contain a plurality of pyrrole or imidazole groups. 

13. The composition of claim 1, wherein' the transcriptional effector is a 
polypeptide sequence. 

14. The composition of claim 2, wherein the transcriptional activator is a 
polypeptide sequence, and the polypeptide sequence comprises at least one copy of an 
activator sequence of amino acids from herpes virus protein VP 16 selected from the group 
consisting of SEQ ID NO:L SEQ ID NO: 2, SEQ ID NO: 8 and SEQ ID NO: 12.. 

15. The composition of claim L wherein the effector is an organic moiety. 

16. The composition of claim 13, wherein the amino terminus of the sequence is 
covalently bound to the linker. 

17. The composition of claim 13, wherein the transcriptional effector moiety has 
molecular weight of less than about 3,000 daltons. 

1 8. The composition of claim 13, wherein the transcriptional effector moiety has 
molecular weight of less than about 1.500 daltons. 

1 9. The composition of claim 1 . wherein the flexible linker comprises a 
polyglycol. 

20. The composition of claim 19, wherein the flexible linker polyglycol contains 
at least six glycol units. 
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2 1 . The composition of claim 1 9. wherein the flexible linker comprises monomer 
units selected from the group consisting of nucleotides, peptides, lower alky Is and oxy sen- 
containing alkyl groups. 

22. The composition of claim 1. wherein the flexible linker is at least 1 5 A in 

length. 

23. The composition of claim L wherein the flexible linker is at least 28 A in 

length. 

24. The composition of claim 1. wherein the amount of transcription initiated on a 
linear double-stranded DNA template is at least ten-fold greater compared to a second 
amount initiated in the absence of the composition. 

25. The composition of claim 1. wherein the .amount of transcription iniiiated-on a 
linear double-stranded DNA template is thirty* to fifty-fold greater compared to the second 
amount in the absence of the composition. 

26. The composition of claim L wherein the DNA binding domain has affinity for 
a DNA binding site on a DNA template, wherein the DNA template is less than about 500 
base pairs in length. 

27. A composition for activating transcription having the structure A-B-C. 
wherein A is a triplex-forming nucleic acid, B is a flexible linker, and C is an effector moiety 
that binds to a site on a transcriptional protein complex comprising an eukaiyotic RNA 
polymerase, wherein B is covalently linked to A and C. 

2S. The composition of claim 27, wherein B is a polyglycol chain, and the 
covaleni linkage of B io C includes a bifunctional crosslinking agent. 
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29. The composition of claim 27, wherein B comprises a polyglycol which is at 
least about 28 A in length, and C comprises an amino acid sequence from herpes virus 
protein VP 16. 

30. A method for assaying a test composition for activity as a transcriptional 
modulator, comprising: 

providing a cell comprising a target gene under the control of at least one 
transcriptional regulatory element and having a DNA binding site; 

contacting the cell with a test composition comprising a DNA binding domain, a test 
moiety of interest and a flexible linker, one end of the linker being covalently bound to the 
DNA binding domain, and the other end of the linker being covalently bound to the test 
compound, wherein the DNA binding domain has affinity for the DNA binding site, under 
conditions which allow transcription to occur; 

detecting any changes in the level of transcriptional activity of the target gene in the 
presence of the test composition compared to the basal level in the absence of the test 
composition, which is a measure of the activity of the test composition as a transcriptional 
activator. 
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31. A method for assaying a test composiiion for activity as a transcriptional 
modulator, comprising: 

providing a test composition comprising a DNA binding domain, a test moiety of 
interest and a flexible linker one end of the linker being covalently bound to the DNA 
binding domain, and the other end of the linker being covalently bound to the test compound, 
the DNA binding domain having affinity for a DNA binding site on a DNA template 
sufficient to bind the site and activate transcription at a promoter; 

contacting the test composition with a transcription mixture comprising a DNA 
template, a eukaryotic RNA polymerase molecule capable of forming a complex with the test 
composition. and the DNA template, a buffer and subsirates under conditions suitable for 
RNA synthesis, and a detection system for quantitation of RNA product, such that RNA is 
synthesized; and, 

detecting any changes in the level of transcriptional activity of the target gene in the 
presence of the test composition compared to the basal level in the absence of the test 
composition, which is a measure of the activity of the test composition as a transcriptional 
activator. 

32. The method of claim 30 or 31. wherein the test moiety is a chemical moiety 
and wherein the test composition has a molecular weight of less than about 3 kD. 

33. The method of claim 30 or 31. wherein the test moiety is a chemical moiety 
and wherein the test compound has a molecular weight of less than about 1 .5 kD. 

34. The method of claim 30 or 51. wherein the test moiety is membrane-permcant. 
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35. The method of claim 30 or 3 1, wherein the test moiety is suspected of having 
transcriptional activation or repressor activity. 

36. The method of claim 30 or 3 1 9 wherein the changes in transcriptional activity 
are detected as variations in observed levels of mRNA, or protein product encoded by the 
target gene. 

37. The method of claim 30 3 wherein the target gene is selected from the group 
consisting of a gene encoding a protein conferring resistance to a drug, a gene encoding an 
enzyme, a gene which rescues an auxotrophic phenotype, and a gene encoding a cell surface 
antigen. 

38. The method of claim 37, wherein the target gene encodes a protein which 
provides for calorimetric, luminescent or fluorescent detection. 

39. The method of claim 30 or 3 1, wherein the binding site of the iest composition 
io tne DNA template is located within 100 base pairs of the site for initiation of transcription. 

40. The method of claim 31, wherein the DNA binding site comprises a plurality 
of repeats of the binding site sequence. 

41. The method of claim 30, wherein the step of providing the test composition 
with a transcription mixture is performed in vivo. 

42. The method of claim 31 , wherein the step of providing the test composition 
with a transcription mixture is performed using high throughput screening technologies 
comprising robotized sample distribution into multiwell dishes. 
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43. The method of claim 42, wherein the step of determining the quaniirv of 
transcript ion is performed using the high throughput methods of detection wherein automated 
piaie readers have programmable computerized programs for data analysis and display. 

44. The method of claim 3 L wherein the DNA binding domain has affinity for a 
DNA binding site on a DNA template, wherein the DNA template is less than about 500 base 
pairs in length. 

45. A method of altering transcriptional activitity, comprising: 
introducing into a cell a transcriptional composition comprising a DNA binding 

domain, a flexible linker, and a transcriptional modulator, one end of the linker being 
covalently bound to the DNA binding domain, and the other end of the linker being 
covalencly bound to the transcriptional modulator. 

46. The method of claim 45 T wherein the modulator is an activator or a repressor. 

47. A kit for assaying a test composition for activity as a transcriptional 
modulator comprising: 

a flexible linker covalently bound to a DNA binding domain, the DNA binding 
domain having affinity for a DNA binding site on a DNA template sufficient to bind the site 
and activate transcription at a promoter; and 

a transcription mixture comprising a DNA template and a eukaryotic RNA 
polymerase molecule that forms a complex with the DNA template. 

48. The kit of claim 47. wherein the modulator is an activator or a repressor. 
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